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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Deen, Keith C. 

Dillon, Susan B. 
Porter, Terence C. 
Sweet, Raymond A. 

(ii) TITLE OF THE INVENTION: Human Monoclonal Antibodies 

(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swedeland Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 19045 

(V) COMPUTER READABLE FORM: 
<A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: ^OS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/030,149 

(B) FILING DATE: Ol-NOV-1997 
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(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Geiger, Kathleen 

5 (B) REGISTRATION NUMBER: 35,880 

(C) REFERENCE/DOCKET NUMBER: P50504 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 610-270-5968 

10 (B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

15 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

25 ( 

Ser lie Thr Gly Gly Ser Asn Gly lie Asn Tyr Ala Asp Ser Val Lys 

15 10 15 

Arg 

30 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ser He Thr Gly Gly Ser Asn Giy He Asn Tyr Ser Asp Ser Val Lys 

15 10 15 

Arg 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ser He Thr Gly Gly Ser Asn Gly He Gin Tyr Ser Asp Ser Val Lys 

^ 5 j 10 15 

Arg 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

5 Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Ser Tyr 

20 25 30 

Glu Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
10 35 40 45 

Ser Tyr lie Ser Ser Ser Gly Ser Thr lie Tyr Tyr Ala Asp Ser Val 

50 55 60 

Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr 
^5 70 75 80 

15 Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
ai ^ 85 90 95 

III Ala Arg 

m 

O 20 (2) INFORMATION FOR SEQ ID NO: 5: 



ii 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 9 amino acids 

(B) TYPE: amino acid 

25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Gly Trp Ser Cys lie He Leu Phe Leu Val Ala Thr Ala Thr Gly 

15 10 15 

Val His Ser Glu Val Gin Leu Leu Glu Val Glu Ser Gly Gly Gly Leu 
35 20 25 30 
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Arg Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Thr Thr 

35 40 45 

Leu Ser Gly Tyr Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly 
50 55 60 

5 Leu Glu Trp Val Ser Ser lie Thr Gly Gly Ser Asn Phe lie Asn Tyr 
65 70 75 80 

Ser Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys 

85 90 95 

Asn Ser Leu Tyr Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Ala 
10 100 105 110 

Val Tyr Tyr Cys Ala Thr Ala Pro He Ala Pro Pro Tyr Phe Asp His 

115 120 125 

Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
130 135 



15 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 138 amino acids 

20 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Glu Phe Gly Leu Ser Trp Val 
1 5 
30 Val Gin Cys Gin Val Gin Leu Val 
20 

Pro Gly Gly Ser Leu Arg Leu Ser 

35 ^ 40 

Ser Gly Tyr Thr Met His Trp Val 
35 50 55 



Phe Leu Val Ala Leu Leu Arg Gly 

10 15 
Val Glu Ser Gly Gly Gly Leu Arg 
25 30 
Cys Ala Ala Ser Gly Thr Thr Leu 
45 

Arg Gin Ala Pro Gly Lys Gly Leu 
60 
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Glu Trp Val Ser Ser lie Thr Gly 
65 70 
Asp Ser Val Lys Gly Arg Phe Thr 
85 

5 Ser Leu Tyr Leu Gin Met Asn Ser 
100 

Tyr Tyr Cys Ala Thr Ala Pro He 
115 120 
Gly Gin Gly Thr Leu Val Thr Val 
10 130 135 



Gly Ser Asn Phe He Asn Tyr Ser 

75 80 
He Ser Arg Asp Asn Ala Lys Asn 

90 95 
Leu Thr Ala Glu Asp Thr Ala Val 
105 110 
Ala Pro Pro Tyr Phe Asp His Trp 
125 

Ser Ser 



(2) INFORMATION FOR SEQ ID N0:7: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala Leu Leu Arg Gly 
25 1 5 I 10 15 

Val Gin Cys Gin Val Gin Leu Val Val Glu Ser Gly Gly Gly Leu Arg 

20 25 30 

Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Thr Thr Leu 
35 40 45 

30 Ser Gly Tyr Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
50 55 60 

Glu Trp Val Ser Ser He Thr Gly Gly Ser Asn Phe He Asn Tyr Ala 
65 70 75 80 

Asp Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Asn 
35 85 90 95 



is 
til 
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Ser Leu Tyr Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Ala Val 

100 105 110 

Tyr Tyr Cys Ala Thr Ala Pro He Ala Pro Pro Tyr Phe Asp His Trp 
115 120 125 

5 Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO: 8: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 



if I (ii) MOLECULE TYPE: protein 

i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



O 20 Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala Leu Leu Arg Gly 

15 10 15 

C;; Val Gin Cys Gin Val Gin Leu Val Val Glu Ser Gly Gly Gly Leu Arg 

20 25 30 

Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Thr Thr Leu 
25 35 40^ 45 

Ser Gly Tyr Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

50 55 60 

Glu Trp Val Ser Ser He Thr Gly Gly Ser Asn Phe He Gin Tyr Ser 
65 70 75 80 

30 Asp Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Asn 

85 90 95 

Ser Leu Tyr Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Ala Val 

100 105 110 

Tyr Tyr Cys Ala Thr Ala Pro He Ala Pro Pro Tyr Phe Asp His Trp 
35 115 120 125 
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m 
ft 



i 



Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO: 9: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

15 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 

15 10 15 

Asp Arg Val Thr lie Thr Cys Arg Ala Ser Gin Ser lie Ser Ser Tyr 
20 25 30 

20 Leu Asn Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie 
35 40 45 

Tyr Ala Ala Ser Ser Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly 

50 55 60 

Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro 
25 65 70 I 75 80 

Glu Asp Phe Ala Thr Tyr Tyr Cys 
85 

(2) INFORMATION FOR SEQ ID NO: 10: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Gly Trp Ser Cys He He Leu Phe Leu Val Ala Thr Ala Thr Gly 

15 10 15 

Val His Ser Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 

20 25 30 

Gly Asp Arg Val Thr He Thr Cys Arg Ala Thr Gin Ser Val Ser Asn 

35 40 45 

Phe Leu Asn Trp Tyr Gin Gin Lys Pro Gly Glu Ala Pro Thr Leu Leu 

50 55 60 

He Tyr Asp Ala Ser Thr Ser Gin Ser Gly Val Pro Ser Arg Phe Ser 
65 70 75 80 

Gly Ser Gly Ser Gly Met Asp Phe Ser Leu Thr He Ser Ser Leu Gin 

85 90 95 

Pro Glu Asp Leu Ala Met Tyr Tyr Cys Gin Ala Ser lie Asn Thr Pro 

100 105 110 

Leu Phe Gly Gly Gly Thr Arg He Asp Met Arg Arg 
115 120 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino ^cids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Arg Val Pro Ala Gin Leu Leu Gly Leu Leu Leu Leu Trp Leu Arg 
15 10 15 
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Gly Ala Arg Cys Asp lie Gin Met Asn Phe Leu Asn Trp Tyr Gin Gin 

20 25 30 

Lys Pro Gly Glu Ala Pro Thr Leu Leu lie Tyr Asp Ala Ser Thr Ser 

35 40 45 

Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Met Asp 

50 55 60 

Phe Ser Leu Thr lie Ser Ser Leu Gin Pro Glu Asp Leu Ala Met Tyr 
65 70 75 80 

Tyr Cys Gin Ala Ser lie Asn Thr Pro Leu Phe Gly Gly Gly Thr Arg 

85 90 95 

lie Asp Met Arg Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

i 

Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp Glu Gin 

15 10 15 

Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr 

20 25 30 

Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser 

35 40 45 

Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 

50 55 60 

Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys 
65 70 75 80 
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5 



15 



His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro 

85 90 95 

Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
100 105 

(2) INFORMATION FOR SEQ ID N0:13: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 106 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp Glu Gin 
15 10 15 

20 Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr 
20 25 30 

Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser 

35 40 45 

Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
25 50 55 I 60 

Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys 
65 70 75 80 

His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Leu Pro 
85 90 95 

30 Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 14: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6284 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUKNCE DESCRIPTION: SEQ ID NO: 14; 



10 



15 



20 



hi 

25 



30 



35 



GACGTCGCGG 


CCGCTCTAGG 


CCTCCAAAAA 


AGCCTCCTCA 


CTACTTCTGG 


AATAGCTCAG 


60 


AGGCCGAGGC 


GGCCTCGGCC 


TCTGCATAAA 


TAAAAAAAAT 


TAGTCAGCCA 


TGCATGGGGC 


120 


GGAGAATGGG 


CGGAACTGGG 


CGGAGTTAGG 


GGCGGGATGG 


GCGGAGTTAG 


GGGCGGGACT 


180 


ATGGTTGCTG 


ACTAATTGAG 


ATGCATCCTT 


TGCATACTTC 


TGCCTGCTGG 


GGAGCCTGGG 


240 


GACTTTCCAC 


ACCTGGTTGC 


TGACTAATTG 


AGATGCATGC 


TTTGCATACT 


TCTGCCTGCT 


300 


GGGGAGCCTG 


GGGACTTTCC 


ACACCCTAAC 


TGACACACAT 


TCCACAGAAT 


TAATTCCCGG 


360 


GGATCGATCC 


GTCGACGTAC 


GACTAGTTAT 


TAATAGTAAT 


CAATTACGGG 


GTCATTAGTT 


420 


CATAGCCCAT 


ATATGGAGTT 


CCGCGTTACA 


TAACTTACGG 


TAAATGGCCC 


GCCTGGCTGA 


480 


CCGCCCAACG 


ACCCCCGCCC 


ATTGACGTCA 


ATAATGACGT 


ATGTTCCCAT 


AGTAACGCCA 


540 


ATAGGGACTT 


TCCATTGACG 


TCAATGGGTG 


GACTATTTAC 


GGTAAACTGC 


CCACTTGGCA 


600 


GTACATCAAG 


TGTATCATAT 


GCCAAGTACG 


CCCCCTATTG 


ACGTCAATGA 


CGGTAAATGG 


660 


CCCGCCTGGC 


ATTATGCCCA 


GTACATGACC 


TTATGGGACT 


TTCCTACTTG 


GCAGTACATC 


720 


TACGTATTAG 


TCATCGCTAT 


TACCATGGTG 


ATGCGGTTTT 


GGCAGTACAT 


CAATGGGCGT 


780 


GGATAGCGGT 


TTOACTCACG 


GGGATTTCCA 


AGTCTCCACC 


CCATTGACGT 


CAATGGGAGT 


840 


X X ^ ^ 


ACCAAAATCA 


ACGGGACTTT 


CCAAAATGTC 


GT A ACAACTC 




900 


ACGCAAATGG 


GCGGTAGGCG 


TGTACGGTGG 


GAGGTCTATA 


TAAGCAGAGC 


TGGGTACGTG 


960 


AACCGTCAGA 


TCGCCTGGAG 


ACGCCATCGA 


ATTCTGAGCA 


CACAGGACCT 


CACCATGGGA 


1020 


TGGAGCTGTA 


TCATCCTCTT 


cttggtag|a 


ACAGCTACAG 


GTGTCCACTC 


CGAGGTCCAA 


1080 


CTGCTCGAGG 


AGTCTGGGGG 


AGGCCTGGTC 


AGGCCTGGCG 


GGTCCCTAAG 


ACTCTCGTGT 


1140 


GCAGCCTCTG 


GAACCACCCT 


CAGTGGCTAT 


ACCATGCACT 


GGGTCCGCCA 


GGCTCCAGGG 


1200 


AAGGGGCTGG 


AGTGGGTCTC 


ATCCATTACT 


GGAGGTAGCA 


ACTTCATAAA 


CTACTCAGAC 


1260 


TCAGTGAAGG 


GCCGATTCAC 


CATCTCCAGA 


GACAACGCCA 


AGAACTCACT 


TTATCTGCAA 


1320 


ATGAACAGCC 


TGACAGCCGA 


GGACACGGCT 


GTCTATTATT 


GTGCGACCGC 


CCCTATAGCA 


1380 


CCGCCCTACT 


TTGACCACTG 


GGGCCAGGGA 


ACCCTGGTCA 


CCGTCTCCTC 


AQCCTCCACC 


1440 


AAGGGCCCAT 


CGGTCTTCCC 


CCTGGCACCC 


TCCTCCAAGA 


GCACCTCTGG 


GGGCACAGCG 


1500 


GCCCTGGGCT 


GCCTGGTCAA 


GGACTACTTC 


CCCGAACCGG 


TGACCGTGTC 


GTGGAACTCA 


1560 


GGCGCCCTGA 


CCAGCGGCGT 


GCACACCTTC 


CCGGCTGTCC 


TACAGTCCTC 


AGGACTCTAC 


1620 


TCCCTCAGCA 


GCGTGGTGAC 


TGTGCCCTCC 


AGCAGCTTGG 


GCACCCAGAC 


CTACATCTGC 


1680 


AACGTGAATC 


ACAAGCCCAG 


CAACACCAAG 


GTGGACAAGA 


AAGTTGAGCC 


CAAATCTTGT 


1740 
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III 
m 
m 

ifi 

CI 
w 

CI 

ill 





GACAAAACTC 


ACACATGCCC 


ACCGTGCCCA 


GCACCTGAAC 


TCCTGGGGGG 


ACCGTCAGTC 


180C 




TTCCTCTTCC 


CCCCAAAACC 


CAAGGACACC 


CTCATGATCT 


CCCGGACCCC 


TGAGGTCACA 


i860 




TGCGTGGTGG 


TGGACGTGAG 


CCACGAAGAC 


CCTGAGGTCA 


AGTTCAACTG 


GTACGTGGAC 


1920 




GGCGTGGAGG 


TGCATAATGC 


CAAGACAAAG 


CCGCGGGAGG 


AGCAGTACAA 


CAGCACGTAC 


1980 


5 


CGGGTGGTCA 


GCGTCCTCAC 


CGTCCTGCAC 


CAGGACTGGC 


TGAATGGCAA 


GGAGTACAAG 


2040 




TGCAAGGTCT 


CCAACAAAGC 


CCTCCCAGCC 


CCCATCGAGA 


AAACCATCTC 


CAAAGCCAAA 


2100 




GGGCAGCCCC 


GAGAACCACA 


GGTGTACACC 


CTGCCCCCAT 


CCCGGGATGA 


GCTGACCAAG 


2160 




AACCAGGTCA 


GCCTGACCTG 


CCTGGTCAAA 


GGCTTCTATC 


CCAGCGACAT 


CGCCGTGGAG 


2220 




TGGGAGAGCA 


ATGGGCAGCC 


GGAGAACAAC 


TACAAGACCA 


CGCCTCCCGT 


GCTGGACTCC 


2280 


10 


GACGGCTCCT 


TCTTCCTCTA 


CAGCAAGCTC 


ACCGTGGACA 


AGAGCAGGTG 


GCAGCAGGGG 


2340 




AACGTCTTCT 


CATGCTCCGT 


GATGCATGAG 


GCTCTGCACA 


ACCACTACAC 


GCAGAAGAGC 


2400 




CTCTCCCTGT 


CTCCGGGTAA 


ATGATAGATA 


TCTACGTATG 


ATCAGCCTCG 


ACTGTGCCTT 


2460 




CTAGTTGCCA 


GCCATCTGTT 


GTTTGCCCCT 


CCCCCGTGCC 


TTCCTTGACC 


CTGGAAGGTG 


2520 




CCACTCCCAC 


TGTCCTTTCC 


TAATAAAATG 


AGGAAATTGC 


ATCGCATTGT 


CTGAGTAGGT 


2580 


15 


GTCATTCTAT 


TCTGGGGGGT 


GGGGTGGGGC 


AGGACAGCAA 


GGGGGAGGAT 


TGGGAAGACA 


2640 




ATAGCAGGCA 


TGCTGGGGAT 


GCGGTGGGCT 


CTATGGAACC 


AGCTGGGGCT 


CGACAGCGCT 


2700 




GGATCTCCCG 


ATCCCCAGCT 


TTGCTTCTGA 


ATTTCTTATT 


TGCATAATGA 


GAAAAAAAGG 


2760 




AAAATTAATT 


TTAACACCAA 


TTCAGTAGTT 


GATTGAGCAA 


ATGCGTTGCC 


AAAAAGGATG 


2820 




CTTTAGAGAC 


AGTGTTCTCT 


GCACAGATAA 


GGACAAACAT 


TATTCAGAGG 


GAGTACCCAG 


2880 


20 


AGCTGAGACT 


CCTAAGCCAG 


TGAGTGGCAC 


AGCATTCTAG 


GGAGAAATAT 


GCTTGTCATC 


2940 




ACCGAAGCCT 


GATTCCGTAG 


AGCCACACCT 


TGGTAAGGGC 


CAATCTGCTC 


ACACAGGATA 


3000 




GAGAGGGCAG 


GAGCCAGGGC 


AGAGCATATA 


AGGTGAGGTA 


GGATCAGTTG 


CTCCTCACAT 


3060 




TTGCTTCTGA 


CATAGTTGTG 


TTGGGAGCTT 


GGATAGCTTG 


GACAGCTCAG 


GGCTGCGATT 


3120 




TCGCGCCAAA 


CTTGACGGCA 


ATCCTAGCGT 


GAAGGCTGGT 


AGGATTTTAT 


CCCCGCTGCC 


3180 


25 


ATCATGGTTC 


GACCATTGAA 


CTGCATCGTC 
f 


GCCGTGTCCC 


AAAATATGGG 


GATTGGCAAG 


3240 




AACGGAGACC 


TACCCTGGCC 


TCCGCTCAGG 


AACGAGTTCA 


AGTACTTCCA 


AAGAATGACC 


3300 




ACAACCTCTT 


CAGTGGAAGG 


TAAACAGAAT 


CTGGTGATTA 


TGGGTAGGAA 


AACCTGGTTC 


3360 




TCCATTCCTG 


AGAAGAATCG 


ACCTTTAAAG 


GACAGAATTA 


ATATAGTTCT 


CAGTAGAGAA 


3420 




CTCAAAGAAC 


CACCACGAGG 


AGCTCATTTT 


CTTGCCAAAA 


GTTTGGATGA 


TGCCTTAAGA 


3480 


30 


CTTATTGAAC 


AACCGGAATT 


GGCAAGTAAA 


GTAGACATGG 


TTTGGATAGT 


CGGAGGCAGT 


3540 




TCTGTTTACC 


AGGAAGCCAT 


GAATCAACCA 


GGCCACCTTA 


GACTCTTTGT 


GACAAGGATC 


3600 




ATGCAGGAAT 


TTGAAAGTGA 


CACGTTTTTC 


CCAGAAATTG 


ATTTGGGGAA 


ATATAAACTT 


3660 




CTCCCAGAAT 


ACCCAGGCGT 


CCTCTCTGAG 


GTCCAGGAGG 


AAAAAGGCAT 


CAAGTATAAG 


3720 




TTTGAAGTCT 


ACGAGAAGAA 


AGACTAACAG 


GAAGATGCTT 


TCAAGTTCTC 


TGCTCCCCTC 


3780 


35 


CTAAAGCTAT 


GCATTTTTAT 


AAGACCATGG 


GACTTTTGCT 


GGCTTTAGAT 


CAGCCTCGAC 


3840 




TGTGCCTTCT 


AGTTGCCAGC 


CATCTGTTGT 


TTGCCCCTCC 


CCCGTGCCTT 


CCTTGACCCT 


3900 
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GGAAGGTGCC 


ACTCCCACTG 


TCCTTTCCTA 


ATAAAATGAG 


GAAATTGCAT 


CGCATTGTCT 


3960 


GAGTAGGTGT 


CATTCTATTC 


TGGGGGGTGG 


GGTGGGGCAG 


GACAGCAAGG 


GGGAGGATTG 


4020 


GGAAGACAAT 


AGCAGGCATG 


CTGGGGATGC 


GGTGGGCTCT 


ATGGAACCAG 


CTGGGGCTCG 


4080 


ATCGAGTGTA 


TGACTGCGGC 


CGCGATCCCG 


TCGAGAGCTT 


GGCGTAATCA 


TGGTCATAGC 


4140 


TGTTTCCTGT 


GTGAAATTGT 


TATCCGCTCA 


CAATTCCACA 


CAACATACGA 


GCCGGAAGCA 


4200 


TAAAGTGTAA 


AGCCTGGGGT 


GCCTAATGAG 


TGAGCTAACT 


CACATTAATT 


GCGTTGCGCT 


4260 


CACTGCCCGC 


TTTCCAGTCG 


GGAAACCTGT 


CGTGCCAGCT 


GCATTAATGA 


ATCGGCCAAC 


4320 


GCGCGGGGAG 


AGGCGGTTTG 


CGTATTGGGC 


GCTCTTCCGC 


TTCCTCGCTC 


ACTGACTCGC 


4380 


TGCGCTCGGT 


CGTTCGGCTG 


CGGCGAGCGG 


TATCAGCTCA 


CTCAAAGGCG 


GTAATACGGT 


4440 


TATCCACAGA 


ATCAGGGGAT 


AACGCAGGAA 


AGAACATGTG 


AGCAAAAGGC 


CAGCAAAAGG 


4500 


CCAGGAACCG 


TAAAAAGGCC 


GCGTTGCTGG 


CGTTTTTCCA 


TAGGCTCCGC 


CCCCCTGACG 


4560 


AGCATCACAA 


AAATCGACGC 


TCAAGTCAGA 


GGTGGCGAAA 


CCCGACAGGA 


CTATAAAGAT 


4620 


ACCAGGCGTT 


TCCCCCTGGA 


AGCTCCCTCG 


TGCGCTCTCC 


TGTTCCGACC 


CTGCCGCTTA 


4680 


CCGGATACCT 


GTCCGCCTTT 


CTCCCTTCGG 


GAAGCGTGGC 


GCTTTCTCAA 


TGCTCACGCT 


4740 


GTAGGTATCT 


CAGTTCGGTG 


TAGGTCGTTC 


GCTCCAAGCT 


GGGCTGTGTG 


CACGAACCCC 


4800 


CCGTTCAGCC 


CGACCGCTGC 


GCCTTATCCG 


GTAACTATCG 


TCTTGAGTCC 


AACCCGGTAA 


4860 


GACACGACTT 


ATCGCCACTG 


GCAGCAGCCA 


CTGGTAACAG 


GATTAGCAGA 


GCGAGGTATG 


4920 


TAGGCGGTGC 


TACAGAGTTC 


TTGAAGTGGT 


GGCCTAACTA 


CGGCTACACT 


AGAAGGACAG 


4980 


TATTTGGTAT 


CTGCGCTCTG 


CTGAAGCCAG 


TTACCTTCGG 


AAAAAGAGTT 


GGTAGCTCTT 


5040 


GATCCGGCAA 


ACAAACCACC 


GCTGGTAGCG 


GTGGTTTTTT 


TGTTTGCAAG 


CAGCAGATTA 


5100 


CGCGCAGAAA 


AAAAGGATCT 


CAAGAAGATC 


CTTTGATCTT 


TTCTACGGGG 


TCTGACGCTC 


5160 


AGTGGAACGA 


AAACTCACGT 


TAAGGGATTT 


TGGTCATGAG 


ATTATCAAAA 


AGGATCTTCA 


5220 


CCTAGATCCT 


TTTAAATTAA 


AAATGAAGTT 


TTAAATCAAT 


CTAAAGTATA 


TATGAGTAAA 


5280 


CTTGGTCTGA 


CAGTTACCAA 


TGCTTAATCA 


GTGAGGCACC 


TATCTCAGCG 


ATCTGTCTAT 


5340 


TTCGTTCATC 


CATAGTTGCC 


TGACTCCC^G 


TCGTGTAGAT 


AACTACGATA 


CGGGAGGGCT 


5400 


TACCATCTGG 


CCCCAGTGCT 


GCAATGATAC 


CGCGAGACCC 


ACGCTCACCG 


GCTCCAGATT 


5460 


TATCAGCAAT 


AAACCAGCCA 


GCCGGAAGGG 


CCGAGCGCAG 


AAGTGGTCCT 


GCAACTTTAT 


5520 


CCGCCTCCAT 


CCAGTCTATT 


AATTGTTGCC 


GGGAAGCTAG 


AGTAAGTAGT 


TCGCCAGTTA 


5580 


ATAGTTTGCG 


CAACGTTGTT 


GCCATTGCTA 


CAGGCATCGT 


GGTGTCACGC 


TCGTCGTTTG 


5640 


GTATGGCTTC 


ATTCAGCTCC 


GGTTCCCAAC 


GATCAAGGCG 


AGTTACATGA 


TCCCCCATGT 


5700 


TGTGCAAAAA 


AGCGGTTAGC 


TCCTTCGGTC 


CTCCGATCGT 


TGTCAGAAGT 


AAGTTGGCCG 


5760 


CAGTGTTATC 


ACTCATGGTT 


ATGGCAGCAC 


TGCATAATTC 


TCTTACTGTC 


ATGCCATCCG 


5820 


TAAGATGCTT 


TTCTGTGACT 


GGTGAGTACT 


CAACCAAGTC 


ATTCTGAGAA 


TAGTGTATGC 


5880 


GGCGACCGAG 


TTGCTCTTGC 


CCGGCGTCAA 


TACGGGATAA 


TACCGCGCCA 


CATAGCAGAA 


5940 


CTTTAAAAGT 


GCTCATCATT 


GGAAAACGTT 


CTTCGGGGCG 


AAAACTCTCA 


AGGATCTTAC 


6000 


CGCTGTTGAG 


ATCCAGTTCG 


ATGTAACCCA 


CTCGTGCACC 


CAACTGATCT 


TCAGCATCTT 


6060 
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TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG 
GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT 
GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT 
AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
<C) STR2USIDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr Gly 

15 10 15 

Val His Ser Glu Val Gin Leu Leu Glu Val 
20 25 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERI^ICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ser Pro Gly Lys 
1 



GCAAAATGCC GCAAAAAAGG 612 0 

CCTTTTTCAA TATTATTGAA 6180 

TGAATGTATT TAGAAAAATA 6240 

ACCT 6284 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Gly Trp Ser Cys lie He Leu Phe Leu Val Ala Thr Ala Thr Gly 

15 10 15 

Val His Ser Glu Leu Thr Gin Ser Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5681 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear / 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



GACGTCGCGG 


CCGCTCTAGG 


CCTCCAAAAA 


AGCCTCCTCA 


CTACTTCTGG 


AATAGCTCAG 


60 


AGGCCGAGGC 


GGCCTCGGCC 


TCTGCATAAA 


TAAAAAAAAT 


TAGTCAGCCA 


TGCATGGGGC 


120 


GGAGAATGGG 


CGGAACTGGG 


CGGAGTTAGG 


GGCGGGATGG 


GCGGAGTTAG 


GGGCGGGACT 


180 


ATGGTTGCTG 


ACTAATTGAG 


ATGCATGCTT 


TGCATACTTC 


TGCCTGCTGG 


GGAGCCTGGG 


240 


GACTTTCCAC 


ACCTGGTTGC 


TGACTAATTG 


AGATGCATGC 


TTTGCATACT 


TCTGCCTGCT 


300 


GGGGAGCCTG 


GGGACTTTCC 


ACACCCTAAC 


TGACACACAT 


TCCACAGAAT 


TAATTCCCGG 


360 


GGATCGATCC 


GTCGACGTAC 


GACTAGTTAT 


TAATAGTAAT 


CAATTACGGG 


GTCATTAGTT 


420 
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CATAGCCCAT 


ATATGGAG TT 


CCGCGTTACA 


TAACTTACGG 


TAAATGGCCC 


GCCTGGCTGA 


480 




CCGCCCAACG 


ACCCCCGCCC 


ATTGACGTCA 


ATAATGACGT 


ATGTTCCCAT 


AGTAACGCCA 


540 




ATAGGGACTT 


TCCATTGACG 


TCAATGGGTG 


GACTATTTAC 


GGTAAACTGC 


CCACTTGGCA 


600 




GTACATCAAG 


TGTATCATAT 


GCCAAGTACG 


CCCCCTATTG 


ACGTCAATGA 


CGGTAAATGG 


660 


c 

3 


CCCGCCTGGC 


ATTATGCCCA 


GTACATGACC 


TTATGGGACT 


TTCCTACTTG 


GCAGTACATC 


720 




TACGTATTAG 


TCATCGCTAT 


TACCATGGTG 


ATGCGGTTTT 


GGCAGTACAT 


CAATGGGCGT 


780 




GGATAGCGGT 


TTGACTCACG 


GGGATTTCCA 


AGTCTCCACC 


CCATTGACGT 


CAATGGGAGT 


840 




TTGTTTTGGC 


ACCAAAATCA 


ACGGGACTTT 


CCAAAATGTC 


GTAACAACTC 


CGCCCCATTG 


900 




ACGCAAATGG 


GCGGTAGGCG 


TGTACGGTGG 


GAGGTCTATA 


TAAGCAGAGC 


TGGGTACGTG 


960 


10 


AACCGTCAGA 


TCGCCTGGAG 


ACGCCATCGA 


ATTCTGAGCA 


CACAGGACCT 


CACCATGGGA 


1020 




TGGAGCTGTA 


TCATCCTCTT 


CTTGGTAGCA 


ACAGCTACAG 


GTGTCCACTC 


CGAGCTCACC 


1080 




CAGTCTCCAT 


CCTCCCTGTC 


TGCATCTGTA 


GGAGACAGAG 


TCACCATCAC 


TTGCCGGGCA 


1140 




ACTCAGAGTG 


TTAGTAACTT 


TTTAAATTGG 


TATCAGCAGA 


AGCCAGGGGA 


AGCCCCTACG 


1200 




CTCCTGATCT 


ATGATGCATC 


CACTTCGCAA 


AGTGGGGTCC 


CATCAAGGTT 


CAGTGGCAGT 


1260 


15 


GGATCTGGGA 


TGGATTTCAG 


TCTCACCATC 


AGCAGTCTGC 


AGCCTGAAGA 


TCTTGCAATG 


1320 




TATTACTGTC 


AAGCGAGTAT 


CAATACCCCG 


CTTTTCGGCG 


GAGGGACCAG 


AATAGATATG 


1380 




AGACGAACTG 


TGGCTGCACC 


ATCTGTCTTC 


ATCTTCCCGC 


CATCTGATGA 


GCAGTTGAAA 


1440 




TCTGGAACTG 


CCTCTGTTGT 


GTGCCTGCTG 


AATAACTTCT 


ATCCCAGAGA 


GGCCAAAGTA 


1500 




CAGTGGAAGG 


TGGATAACGC 


CCTCCAATCG 


GGTAACTCCC 


AGGAGAGTGT 


CACAGAGCAG 


1560 


20 


GACAGCAAGG 


ACAGCACCTA 


CAGCCTCAGC 


AGCACCCTGA 


CGCTGAGCAA 


AGCAGACTAC 


1620 




GAGAAACACA 


AAGTCTACGC 


CTGCGAAGTC 


ACCCATCAGG 


GCCTGAGCTT 


GCCCGTCACA 


1680 




AAGAGCTTCA 


ACAGGGGAGA 


GTGTTAGTGA 


GATGATCCTC 


TAGAGTCATC 


TACGTATGAT 


1740 




CAGCCTCGAC 


TGTGCCTTCT 


AGTTGCCAGC 


CATCTGTTGT 


TTGCCCCTCC 


CCCGTGCCTT 


1800 




CCTTGACCCT 


GGAAGGTGCC 


ACTCCCACTG 


TCCTTTCCTA 


ATAAAATGAG 


GAAATTGCAT 


1860 


25 


CGCATTGTCT 


GAGTAGGTGT 


CATTCTATirC 


TGGGGGGTGG 


GGTGGGGCAG 


GACAGCAAGG 


1920 




GGGAGGATTG 


GGAAGACAAT 


AGCAGGCATG 


CTGGGGATGC 


GGTGGGCTCT 


ATGGAACCAG 


1980 




CTGGGGCTCG 


ACAGCTCGAG 


CTAGCTTTGC 


TTCTCAATTT 


CTTATTTGCA 


TAATGAGAAA 


2040 




AAAAGGAAAA 


TTAATTTTAA 


CACCAATTCA 


GTAGTTGATT 


GAGCAAATGC 


GTTGCCAAAA 


2100 




AGGATGCTTT 


AGAGACAGTG 


TTCTCTGCAC 


AGATAAGGAC 


AAACATTATT 


CAGAGGGAGT 


2160 


30 


ACCCAGAGCT 


GAGACTCCTA 


AGCCAGTGAG 


TGGCACAGCA 


TTCTAGGGAG 


AAATATGCTT 


2220 




GTCATCACCG 


AAGCCTGATT 


CCGTAGAGCC 


ACACCTTGGT 


AAGGGCCAAT 


CTGCTCACAC 


2280 




AGGATAGAGA 


GGGCAGGAGC 


CAGGGCAGAG 


CATATAAGGT 


GAGGTAGGAT 


CAGTTGCTCC 


2340 




TCACATTTGC 


TTCTGACATA 


GTTGTGTTGG 


GA-GCTTGGAT 


CGATCCACCA 


TGGTTGAACA 


2400 




AGATGGATTG 


CACGCAGGTT 


CTCCGGCCGC 


TTGGGTGGAG 


AGGCTATTCG 


GCTATGACTG 


2460 


35 


GGCACAACAG 


ACAATCGGCT 


GCTCTGATGC 


CGCCGTGTTC 


CGGCTGTCAG 


CGCAGGGGCG 


2520 




CCCGGTTCTT 


TTTGTCAAGA 


CCGACCTGTC 


CGGTGCCCTG 


AATGAACTGC 


AGGACGAGGC 


2580 
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AGCGCGGCTA TCGTGGCTGG CCACGACGGG CGTTCCTTGC GCAGCTGTGC TCGACGTTGT 2 64: 

CACTGAAGCG GGAAGGGACT GGCTGCTATT GGGCGAAGTG CCGGGGCAGG ATCTCCTGTC 270C 

ATCTCACCTT GCTCCTGCCG AGAAAGTATC CATCATGGCT GATGCAATGC GGCGGCTGCA 276 C 

TACGCTTGAT CCGGCTACCT GCCCATTCGA CCACCAAGCG AAACATCGCA TCGAGCGAGC 2 82 3 

5 ACGTACTCGG ATGGAAGCCG GTCTTGTCGA TCAGGATGAT CTGGACGAAG AGCATCAGGG 288C 

GCTCGCGCCA GCCGAACTGT TCGCCAGGCT CAAGGCGCGC ATGCCCGACG GCGAGGATCT 2940 

CGTCGTGACC CATGGCGATG CCTGCTTGCC GAATATCATG GTGGAAAATG GCCGCTTTTC 3 00 0 

TGGATTCATC GACTGTGGCC GGCTGGGTGT GGCGGACCGC TATCAGGACA TAGCGTTGGC 3060 

TACCCGTGAT ATTGCTGAAG AGCTTGGCGG CGAATGGGCT GACCGCTTCC TCGTGCTTTA 3120 

10 CGGTATCGCC GCTCCCGATT CGCAGCGCAT CGCCTTCTAT CGCCTTCTTG ACGAGTTCTT 318 0 

CTGAGCGGGA CTCTGGGGTT CGAAATGACC GACCAAGCGA CGCCCAACCT GCCATCACGA 3240 

GATTTCGATT CCACCGCCGC CTTCTATGAA AGGTTGGGCT TCGGAATCGT TTTCCGGGAC 3300 

GCCGGCTGGA TGATCCTCCA GCGCGGGGAT CTCATGCTGG AGTTCTTCGC CCACCCCAAC 33 60 

TTGTTTATTG CAGCTTATAA TGGTTACAAA TAAAGCAATA GCATCACAAA TTTCACAAAT 3420 

15 AAAGCATTTT TTTCACTGCA TTCTAGTTGT GGTTTGTCCA AACTCATCAA TGTATCTTAT 3480 

CATGTCTGGA TCGCGGCCGC GATCCCGTCG AGAGCTTGGC GTAATCATGG TCATAGCTGT 3 540 

TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA CATACGAGCC GGAAGCATAA 3 600 

l_ AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC ATTAATTGCG TTGCGCTCAC 3660 

TGCCCGCTTT CCAGTCGGGA AACCTGTCGT GCCAGCTGCA TTAATGAATC GGCCAACGCG 372 0 

|«| 20 CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC 3 780 

i? GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 3 840 

CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA 3 900 

GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC 3960 

ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACAGGACTA TAAAGATACC 4020 

25 AGGCGTTTCC CCCTGGAAGC TCCCTCGfGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG 40S0 

GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCAATGC TCACGCTGTA 414 0 

GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG 420 0 

TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC 4260 

ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG AGGTATGTAG 4320 

30 GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG CTACACTAGA AGGACAGTAT 4380 

TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 4440 

CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC 45 CO 

GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT GACGCTCAGT 45 63 

GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT ATCAAAAAGG ATCTTCACCT 4620 

35 AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT 4680 

GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC TGTCTATTTC 4740 

64 
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GT ruATCCAT 


AGTTGCCTGA 


CTCCCCGTCG 


TGTAGATAAC 


TACGATACGG 


GAGGGCTTAC 


4800 


CATV, ToGCCC 


CAGTGCTGCA 


ATGATACCGC 


GAGACCCACG 


CTCACCGGCT 


CCAGATTTAT 


4860 


CAGCAATAAA 


CCAGCCAGCC 


GGAAGGGCCG 


AGCGCAGAAG 


TGGTCCTGCA 


ACTTTATCCG 


4920 


CCTCCATCCA 


GTCTATTAAT 


TGTTGCCGGG 


AAGCTAGAGT 


AAGTAGTTCG 


CCAGTTAATA 


4980 


11 1 vjV-uv^AA 


CGTTGTTGCC 


ATTGCTACAG 


GCATCGTGGT 


GTCACGCTCG 


TCGTTTGGTA 


5040 


TGGCTTCATT 


CAGCTCCGGT 


TCCCAACGAT 


CAAGGCGAGT 


TACATGATCC 


CCCATGTTGT 


5100 


GCAAAAAAGC 


GGTTAGCTCC 


TTCGGTCCTC 


CGATCGTTGT 


CAGAAGTAAG 


TTGGCCGCAG 


5160 


TGTTATCACT 


CATGGTTATG 


GCAGCACTGC 


ATAATTCTCT 


TACTGTCATG 


CCATCCGTAA 


5220 


GATGCTTTTC 


TGTGACTGGT 


GAGTACTCAA 


CCAAGTCATT 


CTGAGAATAG 


TGTATGCGGC 


5280 


GACCGAGTTG 


CTCTTGCCCG 


GCGTCAATAC 


GGGATAATAC 


CGCGCCACAT 


AGCAGAACTT 


5340 


TAAAAGTGCT 


CATCATTGGA 


AAACGTTCTT 


CGGGGCGAAA 


ACTCTCAAGG 


ATCTTACCGC 


5400 


TGTTGAGATC 


CAGTTCGATG 


TAACCCACTC 


GTGCACCCAA 


CTGATCTTCA 


GCATCTTTTA 


5460 


CTTTCACCAG 


CGTTTCTGGG 


TGAGCAAAAA 


CAGGAAGGCA 


AAATGCCGCA 


AAAAAGGGAA 


5520 


TAAGGGCGAC 


ACGGAAATGT 


TGAATACTCA 


TACTCTTCCT 


TTTTCAATAT 


TATTGAAGCA 


5580 


TTTATCAGGG 


TTATTGTCTC 


ATGAGCGGAT 


ACATATTTGA 


ATGTATTTAG 


AAAAATAAAC 


5640 


AAATAGGGGT 


TCCGCGCACA 


TTTCCCCGAA 


AAGTGCCACC 


T 




5681 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{D> TOPOLOGY: linear 

« 
k 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Leu Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1427 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



GAATTCGGTA 


CCATGGAGTT 


TGGGCTGAGC 


TGGGTTTTCC 


TCGTGGCTCT 


TTTAAGAGGT 


GTCCAGTGTC 


AGGTGCAGCT 


GGTGGAGTCT 


GGGGGAGGCC 


TGGTCAGGCC 


TGGCGGGTCC 


CTAAGACTCT 


CGTGTGCAGC 


CTCTGGAACC 


ACCCTCAGTG 


GCTATACCAT 


GCACTGGGTC 


CGCCAGGCTC 


CAGGGAAGGG 


GCTGGAGTGG 


GTCTCATCCA 


TTACTGGAGG 


TAGCAACTTC 


ATAAACTACT 


CAGACTCAGT 


GAAGGGCCGA 


TTCACCATCT 


CCAGAGACAA 


CGCCAAGAAC 


TCACTTTATC 


TGCAAATGAA 


CAGCCTGACA 


GCCGAGGACA 


CGGCTGTCTA 


TTATTGTGCG 


ACCGCCCCTA 


TAGCACCGCC 


CTACTTTGAC 


CACTGGGGCC 


AGGGAACCCT 


GGTCAGCGTC 


TCCTCAGCCT 


CCACCAAGGG 


CCCATCGGTC 


TTCCCCCTGG 


CACCCTCCTC 


CAAGAGCACC 


TCTGGGGGCA 


CAGCGGCCCT 


GGGCTGCCTG 


GTCAAGGACT 


ACTTCCCCGA 


ACCGGTGACC 


GTGTCGTGGA 


ACTCAGGCGC 


CCTGACCAGC 


GGCGTGCACA 


CCTTCCCGGC 


TGTCCTACAG 


TCCTCAGGAC 


TCTACTCCCT 


CAGCAGCGTG 


GTGACCGTGC 


CCTCCAGCAG 


CTTGGGCACC 


CAGACCTACA 


TCTGCAACGT 


GAATCACAAG 


CCCAGCAACA 


CCAAGGTGGA 


CAAGAAAGTT 


GAGCCCAAAT 


CTTGTGACAA 


AACTCACACA 


TGCCCACCGT 


GCCCAGCACC 


TGAACTCCTG 


GGGGGACCGT 


CAGTCTTCCT 


CTTCCCCCCA 


AAACCCAAGG 


ACACCCTCAT 


GATCTGCCGG 


ACCCCTGAGG 


TCACATGCGT 


GGTGGTGGAC 


GTGAGCCACG 


AAGACCCTGA 


GGTCAAGTTC 


AACTGGTACG 


TGGACGGCGT 


GGAGGTGCAT 


AATGCCAAGA 


CAAAGCCGCG 


GGAGGAGCAG 


TACAACAGCA 


CGTACCGGGT 


GGTCAGCGTC 


CTCACCGTCC 


TGCACCAGGA 


CTGGCTGAAT 


GGCAAGGAGT 


ACAAGTGCAA 


ggtctcc|ac 


AAAGCCCTCC 


CAGCCCCCAT 


CGAGAAAACC 


ATCTCCAAAG 


CCAAAGGGCA 


GCCCCGAGAA 


CCACAGGTGT 


ACACCCTGCC 


CCCATCCCGG 


GATGAGCTGA 


CCAAGAACCA 


GGTCAGCCTG 


ACCTGCCTGG 


TCAAAGGCTT 


CTATCCCAGC 


GACATCGCCG 


TGGAGTGGGA 


GAGCAATGGG 


CAGCCGGAGA 


ACAACTACAA 


GACCACGCCT 


CCCGTGCTGG 


ACTCCGACGG 


CTCCTTCTTC 


CTCTACAGCA 


AGCTCACCGT 


GGACAAGAGC 


AGGTGGCAGC 


AGGGGAACGT 


CTTCTCATGC 


TCCGTGATGC 


ATGAGGCTCT 


GCACAACCAC 


TACACGCAGA 


AGAGCCTCTC 


CCTGTCTCCG 


GGTAAATGAT 


AGATATC 





(2) INFORMATION FOR SEQ ID NO: 21: 

{i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala Leu Leu Arg Gly 

15 10 15 

Val Gin Cys Gin Val Gin Leu Val 
20 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i> SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GAATTCCATG 


GACATGAGGG 


TCCCCGClfCA 
AGATGACCCA 


GCTCCTAGGG 


CTCCTGCTGC 


TCTGGCTCCG 


60 


AGGTGCCAGA 


TGTGACATCC 


CTCTCCATCC 


TCCCTGTCTG 


CATCTGTAGG 


120 


AGACAGAGTC 


ACCATCACTT 


GCCGGGCAAC 


TCAGAGTGTT 


AGTAACTTTT 


TAAATTGGTA 


180 


TCAGCAGAAG 


CCAGGGGAAG 


CCCCTACGCT 


CCTGATCTAT 


GATGCATCCA 


CTTCGCAAAG 


240 


TGGGGTCCCA 


TCAAGGTTCA 


GTGGCAGTGG 


ATCTGGGATG 


GATTTCAGTC 


TCACCATCAG 


300 


CAGTCTGCAG 


CCTGAAGATC 


TTGCAATGTA 


TTACTGTCAA 


GCGAGTATCA 


ATACCCCGCT 


360 


TTTCGGCGGA 


GGGACCAGAA 


TAGATATGAG 


ACGAACTGTG 


GCTGCACCAT 


CTGTCTTCAT 


420 


CTTCCCGCCA 


TCTGATGAGC 


AGTTGAAATC 


TGGAACTGCC 


TCTGTTGTGT 


GCCTGCTGAA 


480 


TAACTTCTAT 


CCCAGAGAGG 


CCAAAGTACA 


GTGGAAGGTG 


GATAACGCCC 


TCCAATCGGG 


540 


TAACTCCCAG 


GAGAGTGTCA 


CAGAGCAGGA 


CAGCAAGGAC 


AGCACCTACA 


GCCTCAGCAG 


600 


CACCCTGACG 


CTGAGCAAAG 


CAGACTACGA 


GAAACACAAA 


GTCTACGCCT 


GCGAAGTCAC 


660 


CCATCAGGGC 


CTGAGCTTGC 


CCGTCACAAA 


GAGCTTCAAC 


AGGGGAGAGT 


GTTAGTGAGA 


720 
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TGATCCTCTA GA 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Arg Val Pro Ala Gin Leu Leu Giy Leu Leu Leu Leu Trp Leu Arg 

15 10 15 

Gly Ala Arg Cys Asp lie Gin Met Thr 
20 25 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: sing|.e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Leu Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO; 25: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



GAATTCGGTA 


CCATGGAGTT 


TGGGCTGAGC 


TGGGTTTTCC 


TCGTGGCTCT 


TTTAAGAGGT 


60 


GTCCAGTGTC 


AGGTGCAGCT 


GGTGGAGTCT 


GGGGGAGGCC 


TGGTCAGGCC 


TGGCGGGTCC 


120 


CTAAGACTCT 


CGTGTGCAGC 


CTCTGGAACC 


ACCCTCAGTG 


GCTATACCAT 


GCACTGGGTC 


180 


CGCCAGGCTC 


CAGGGAAGGG 


GCTGGAGTGG 


GTCTCATCCA 


TTACTGGAGG 


TAGCAACTTC 


240 


ATAAACTACG 


CAGACTCAGT 


GAAGGGCCGA 


TTCACCATCT 


CCAGAGACAA 


CGCCAAGAAC 


300 


TCACTTTATC 


TGCAAATGAA 


CAGCCTGACA 


GCCGAGGACA 


CGGCTGTCTA 


TTATTGTGCG 


360 


ACCGCCCCTA 


TAGCACCGCC 


CTACTTTGAC 


CACTGGGGCC 


AGGGAACCCT 


GGTCAGCGTC 


420 


TCCTCAGCCT 


CCACCAAGGG 


CCCATCGGTC 


TTCCCCCTGG 


CACCCTCCTC 


CAAGAGCACC 


480 


TCTGGGGGCA 


CAGCGGCCCT 


GGGCTGCCTG 


GTCAAGGACT 


ACTTCCCCGA 


ACCGGTGACC 


540 


GTGTCGTGGA 


ACTCAGGCGC 


CCTGACCAGC 


GGCGTGCACA 


CCTTCCCGGC 


TGTCCTACAG 


600 


TCCTCAGGAC 


TCTACTCCCT 


CAGCAGCGTG 


GTGACCGTGC 


CCTCCAGCAG 


CTTGGGCACC 


660 


CAGACCTACA 


TCTGCAACGT 


GAATCACAAG 


CCCAGCAACA 


CCAAGGTGGA 


CAAGAAAGTT 


720 


GAGCCCAAAT 


CTTGTGACAA 


AACTCACACA 


TGCCCACCGT 


GCCCAGCACC 


TGAACTCCTG 


780 


GGGGGACCGT 


CAGTCTTCCT 


CTTCCCCCCA 


AAACCCAAGG. 


ACACCCTCAT 


GATCTCCCGG 


840 


ACCCCTGAGG 


TCACATGCGT 


GGTGGTGGAC 


GTGAGCCACG 


AAGACCCTGA 


GGTCAAGTTC 


900 


AACTGGTACG 


TGGACGGCGT 


GGAGGTGC^T 


AATGCCAAGA 


CAAAGCCGCG 


GGAGGAGCAG 


960 


TACAACAGCA 


CGTACCGGGT 


GGTCAGCGTC 


CTCACCGTCC 


TGCACCAGGA 


CTGGCTGAAT 


1020 


GGCAAGGAGT 


ACAAGTGCAA 


GGTCTCCAAC 


AAAGCCCTCC 


CAGCCCCCAT 


CGAGAAAACC 


1080 


ATCTCCAAAG 


CCAAAGGGCA 


GCCCCGAGAA 


CCACAGGTGT 


ACACCCTGCC 


CCCATCCCGG 


1140 


GATGAGCTGA 


CCAAGAACCA 


GGTCAGCCTG 


ACCTGCCTGG 


TCAAAGGCTT 


CTATCCCAGC 


1200 


GACATCGCCG 


TGGAGTGGGA 


GAGCAATGGG 


CAGCCGGAGA 


ACAACTACAA 


GACCACGCCT 


1260 


CCCGTGCTGG 


ACTCCGACGG 


CTCCTTCTTC 


CTCTACAGCA 


AGCTCACCGT 


GGACAAGAGC 


1320 


AGGTGGCAGC 


AGGGGAACGT 


CTTCTCATGC 


TCCGTGATGC 


ATGAGGCTCT 


GCACAACCAC 


1380 


TACACGCAGA 


AGAGCCTCTC 


CCTGTCTCCG 


GGTAAATGAT 


AGATATC 




1427 



(2) INFORMATION FOR SEQ ID NO: 26: 

69 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26; 

Ser Asn Phe lie Asn Tyr Ala 
1 5 



III (2) INFORMATION FOR SEQ ID NO: 27: 

|! ^5 

5ft ti) SEQUENCE CHARACTERISTICS: 

III (A) LENGTH: 1427 base pairs 

^ (B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 
|§ 20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



GAATTCGGTA 


CCATGGAGTT 


TGGGCTG^GC 


TGGGTTTTCC 


TCGTGGCTCT 


TTTAAGAGGT 


60 


GTCCAGTGTC 


AGGTGCAGCT 


GGTGGAGTCT 


GGGGGAGGCC 


TGGTCAGGCC 


TGGCGGGTCC 


120 


CTAAGACTCT 


CGTGTGCAGC 


CTCTGGAACC 


ACCCTCAGTG 


GCTATACCAT 


GCACTGGGTC 


180 


CGCCAGGCTC 


CAGGGAAGGG 


GCTGGAGTGG 


GTCTCATCCA 


TTACTGGAGG 


TAGCAACTTC 


240 


ATACAATACT 


CAGACTCAGT 


GAAGGGCCGA 


TTCACCATCT 


CCAGAGACAA 


CGCCAAGAAC 


300 


TCACTTTATC 


TGCAAATGAA 


CAGCCTGACA 


GCCGAGGACA 


CGGCTGTCTA 


TTATTGTGCG 


360 


ACCGCCCCTA 


TAGCACCGCC 


CTACTTTGAC 


CACTGGGGCC 


AGGGAACCCT 


GGTCACCGTC 


420 


TCCTCAGCCT 


CCACCAAGGG 


CCCATCGGTC 


TTCCCCCTGG 


CACCCTCCTC 


CAAGAGCACC 


480 


TCTGGGGGCA 


CAGCGGCCCT 


GGGCTGCCTG 


GTCAAGGACT 


ACTTCCCCGA 


ACCGGTGACC 


540 


GTGTCGTGGA 


ACTCAGGCGC 


CCTGACCAGC 


GGCGTGCACA 


CCTTCCCGGC 


TGTCCTACAG 


600 


TCCTCAGGAC 


TCTACTCCCT 


CAGCAGCGTG 


GTGACCGTGC 


CCTCCAGCAG 


CTTGGGCACC 


660 


CAGACCTACA 


TCTGCAACGT 


GAATCACAAG 


CCCAGCAACA 


CCAAGGTGGA 


CAAGAAAGTT 


720 
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AACTCACACA 


TGCCCACCGT 


GCCCAGCACC 


TGAACTCCTG 


780 






CTTCCCCCCA 


AAACCCAAGG 


ACACCCTCAT 


GATCTCCCGG 


840 






GGTGGTGGAC 


GTGAGCCACG 


AAGACCCTGA 


GGTCAAGTTC 


900 


AACTGGTAPR 


'Vein a cnrrr^ m 


GGAGGTGCAT 


AATGCCAAGA 


CAAAGCCGCG 


GGAGGAGCAG 


960 




V3 i /VL. L.Vj\3V3 X 


tiGTCAGCGTC 


CTCACCGTCC 


TGCACCAGGA 


CTGGCTGAAT 


1020 


GGCAAGGAGT 


ACAAGTGCAA 


GGTCTCCAAC 


AAAGCCCTCC 


CAGCCCCCAT 


CGAGAAAACC 


1080 


ATCTCCAAAG 


CCAAAGGGCA 


GCCCCGAGAA 


CCACAGGTGT 


ACACCCTGCC 


CCCATCCCGG 


1140 


GATGAGCTGA 


CCAAGAACCA 


GGTCAGCCTG 


ACCTGCCTGG 


TCAAAGGCTT 


CTATCCCAGC 


1200 


GACATCGCCG 


TGGAGTGGGA 


GAGCAATGGG 


CAGCCGGAGA 


ACAACTACAA 


GACCACGCCT 


1260 


CCCGTGCTGG 


ACTCCGACGG 


CTCCTTCTTC 


CTCTACAGCA 


AGCTCACCGT 


GGACAAGAGC 


1320 


AGGTGGCAGC 


AGGGGAACGT 


CTTCTCATGC 


TCCGTGATGC 


ATGAGGCTCT 


GCACAACCAC 


1380 


TACACGCAGA 


AGAGCCTCTC 


CCTGTCTCCG 


GGTAAATGAT 


AGATATC 




1427 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEJJGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

/ 

Ser Asn Phe lie Gin Tyr Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



10 



GAATTCCATG 


GACATGAGGG 


TCCCCGCTCA 


GCTCCTAGGG 


CTCCTGCTGC 


TCTGGCTCCG 


60 


AGGTGCCAGA 


TGTGACATCC 


AGATGACCCA 


GTCTCCATCC 


TCCCTGTCTG 


CATCTGTAGG 


120 


AGACAGAGTC 


ACCATCACTT 


GCCGGGCAAC 


TCAGAGTGTT 


AGTAACTTTT 


TAAATTGGTA 


180 


TCAGCAGAAG 


CCAGGGGAAG 


CCCCTACGCT 


CCTGATCTAT 


GATGCATCCA 


CTTCGCAAAG 


240 


TGGGGTCCCA 


TCAAGGTTCA 


GTGGCAGTGG 


ATCTGGGATG 


GATTTCAGTC 


TCACCATCAG 


300 


CAGTCTGCAG 


CCTGAAGATC 


TTGCAATGTA 


TTACTGTCAA 


GCGAGTATCA 


ATACCCCGCT 


360 


TTTCGGCGGA 


GGGACCAGAA 


TAGATATGAG 


ACGAACTGTG 


GCTGCACCAT 


CTGTCTTCAT 


420 


CTTCCCGCCA 


TCTGATGAGC 


AGTTGAAATC 


TGGAACTGCC 


TCTGTTGTGT 


GCCTGCTGAA 


480 


TAACTTCTAT 


CCCAGAGAGG 


CCAAAGTACA 


GTGGAAGGTG 


GATAACGCCC 


TCCAATCGGG 


540 


TAACTCCCAG 


GAGAGTGTCA 


CAGAGCAGGA 


CAGCAAGGAC 


AGCACCTACA 


GCCTCAGCAG 


600 


CACCCTGACG 


CTGAGCAAAG 


CAGACTACGA 


GAAACACAAA 


GTCTACGCCT 


GCGAAGTCAC 


660 


CCATCAGGGC 


CTGAGCTCGC 


CCGTCACAAA 


GAGCTTCAAC 


AGGGGAGAGT 


GTTAGTGAGA 


720 


TGATCCTCTA 


GATCTACGTA 


TGATCAGCCT 


CGACTGTGCC 


TT 




762 



15 



(2) INFORMATION FOR SEQ ID NO: 30: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

30 Ser Pro Val Thr Lys Ser Phe Thr Arg Gly Gin Cys 
15 10 
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